Home Projects JIIT Placement Alerts Architecture & Design Database Schema & Data Model Collection Schemas & Field Definitions Jobs Collection

Jobs Collection

Referenced Files

superset_client.py database_service.py db_client.py structured_job_listings.json placement_offers.json

Table of Contents#

Introduction
Project Structure
Core Components
Architecture Overview
Detailed Component Analysis
Dependency Analysis
Performance Considerations
Troubleshooting Guide
Conclusion

Introduction#

This document defines the Jobs collection schema used to store structured job profile data extracted from the SuperSet portal. It explains the job_id unique identifier field and its relationship to MongoDB’s ObjectId, details the company and job_profile fields, job_description content storage, and the qualification_criteria embedded structure containing min_cgpa threshold, branches array, and batch_years array. It also covers the position_details structure with total_positions, job_location, and job_type enumeration, the compensation embedded document with base_salary, bonus, and currency fields, and timestamps for application_deadline, posted_at, and metadata timestamps. Validation rules, array field requirements, and example documents are provided to illustrate different job types and qualification criteria combinations.

Project Structure#

The Jobs collection is part of the MongoDB database managed by the application. The schema is defined in the client layer and persisted through the database service.

graph TB subgraph "Application Layer" SC["SupersetClientService
Fetches and structures job data"] DS["DatabaseService
Upserts Jobs to MongoDB"] DC["DBClient
Provides MongoDB collections"] end subgraph "Database" JC["Jobs Collection
Schema: job_id, company, job_profile,
qualification_criteria, position_details,
compensation, timestamps"] end SC --> DS DS --> DC DC --> JC

Diagram sources

Section sources

Core Components#

The Jobs collection schema is defined by the Job model and stored in MongoDB. The schema fields and their types are derived from the Job model and verified against sample data.

job_id: String (unique identifier for the job)
company: String
job_profile: String
qualification_criteria: Embedded document with:
- min_cgpa: Number (float)
- branches: Array of strings
- batch_years: Array of numbers
position_details: Embedded document with:
- total_positions: Number
- job_location: String
- job_type: Enumerated string
compensation: Embedded document with:
- base_salary: Number
- bonus: Number
- currency: String
application_deadline: Number (epoch milliseconds)
posted_at: Number (epoch milliseconds)
metadata timestamps: saved_at, updated_at

Validation rules and array requirements:

Arrays must not be empty for branches and batch_years
Minimally one of min_cgpa, branches, or batch_years must be specified
job_type must be one of the enumerated values
application_deadline must be greater than posted_at if both are present

Section sources

Architecture Overview#

The Jobs collection is populated by extracting job data from SuperSet, structuring it into the Job model, and persisting it to MongoDB via the DatabaseService.

sequenceDiagram participant SS as "SupersetClientService" participant API as "SuperSet API" participant DS as "DatabaseService" participant DB as "MongoDB Jobs Collection" SS->>API : Fetch job listings API-->>SS : Raw job data SS->>SS : Structure into Job model SS->>DS : Upsert structured job DS->>DB : Insert or replace job document DB-->>DS : Acknowledgment DS-->>SS : Success status

Diagram sources

Detailed Component Analysis#

Job Model Definition#

The Job model defines the schema for storing job data. It includes identifiers, descriptive fields, embedded qualification criteria, position details, compensation, and timestamps.

classDiagram class Job { +string id +string job_profile +string company +int placement_category_code +string placement_category +string content +int createdAt +int deadline +EligibilityMark[] eligibility_marks +string[] eligibility_courses +string[] allowed_genders +string job_description +string location +float package +string annum_months +string package_info +string[] required_skills +string[] hiring_flow +string placement_type +Document[] documents } class EligibilityMark { +string level +float criteria } class Document { +string name +string identifier +string url } Job --> EligibilityMark : "contains" Job --> Document : "contains"

Diagram sources

superset_client.py

Section sources

superset_client.py

Jobs Collection Schema#

The Jobs collection schema is derived from the Job model and validated against sample data. The schema includes the following fields:

job_id (String): Unique identifier for the job
company (String): Name of the company
job_profile (String): Title of the job
qualification_criteria (Embedded Document):
- min_cgpa (Number): Minimum cumulative grade point average
- branches (Array of Strings): Eligible academic branches
- batch_years (Array of Numbers): Eligible batch years
position_details (Embedded Document):
- total_positions (Number): Total number of positions
- job_location (String): Location of the job
- job_type (Enumerated String): Type of job (e.g., full-time, internship)
compensation (Embedded Document):
- base_salary (Number): Base salary amount
- bonus (Number): Bonus amount
- currency (String): Currency code
application_deadline (Number): Application deadline in epoch milliseconds
posted_at (Number): Posted timestamp in epoch milliseconds
metadata timestamps:
- saved_at (Number): Timestamp when the document was saved
- updated_at (Number): Timestamp when the document was last updated

Validation rules:

Arrays branches and batch_years must not be empty
At least one of min_cgpa, branches, or batch_years must be specified
job_type must be one of the enumerated values
application_deadline must be greater than posted_at if both are present

Section sources

Example Documents#

Below are example documents illustrating different job types and qualification criteria combinations:

Example 1: Full-time job with CGPA threshold and branch eligibility { “job_id”: “7d7dd5e9-51e6-46b6-a0e2-c8cabf06acdc”, “company”: “Axeno”, “job_profile”: “Software Intern”, “qualification_criteria”: { “min_cgpa”: 7.0, “branches”: [“B.Tech - CSE”, “M.Tech. - CSE”], “batch_years”: [2026] }, “position_details”: { “total_positions”: 5, “job_location”: “Noida”, “job_type”: “full-time” }, “compensation”: { “base_salary”: 600000, “bonus”: 0, “currency”: “INR” }, “application_deadline”: 1755751008000, “posted_at”: 1755688649000, “metadata”: { “saved_at”: 1755688649000, “updated_at”: 1755688649000 } }

Example 2: Internship with multiple branch eligibility and CGPA thresholds { “job_id”: “8c8530ea-07d6-4da1-81a7-595412905513”, “company”: “Oracle Financial Services Software Limited (OFSS)”, “job_profile”: “Associate Consultant”, “qualification_criteria”: { “min_cgpa”: 7.0, “branches”: [“M.Tech. - CSE”, “B.Tech - IT”], “batch_years”: [2026] }, “position_details”: { “total_positions”: 10, “job_location”: “Bengaluru, Mumbai, Pune or Chennai”, “job_type”: “internship” }, “compensation”: { “base_salary”: 982054, “bonus”: 85100, “currency”: “INR” }, “application_deadline”: null, “posted_at”: 1755676866000, “metadata”: { “saved_at”: 1755676866000, “updated_at”: 1755676866000 } }

Example 3: Remote job with branch and batch eligibility { “job_id”: “9b2d06d3-37d7-49ee-92cb-c161f8f6c8c1”, “company”: “Recruit CRM”, “job_profile”: “Customer Success—Associate”, “qualification_criteria”: { “min_cgpa”: 5.0, “branches”: [“M.Tech (Integrated) - CSE”, “B.Tech - CSE”], “batch_years”: [2026] }, “position_details”: { “total_positions”: 8, “job_location”: “Remote”, “job_type”: “full-time” }, “compensation”: { “base_salary”: 800000, “bonus”: 0, “currency”: “INR” }, “application_deadline”: 1755765057000, “posted_at”: 1755674049000, “metadata”: { “saved_at”: 1755674049000, “updated_at”: 1755674049000 } }

Section sources

structured_job_listings.json

Data Persistence Flow#

The Jobs collection is persisted through the DatabaseService, which handles upsert operations and maintains metadata timestamps.

flowchart TD Start(["Upsert Structured Job"]) --> Validate["Validate job_id presence"] Validate --> Exists{"Job exists?"} Exists --> |Yes| Update["Merge fields and update updated_at"] Exists --> |No| Insert["Insert with saved_at and updated_at"] Update --> Done(["Return success"]) Insert --> Done

Diagram sources

database_service.py

Section sources

database_service.py

Dependency Analysis#

The Jobs collection depends on the Job model and is persisted via the DatabaseService and DBClient.

graph TB JM["Job Model
superset_client.py"] DS["DatabaseService
database_service.py"] DC["DBClient
db_client.py"] JC["Jobs Collection
MongoDB"] JM --> DS DS --> DC DC --> JC

Diagram sources

Section sources

Performance Considerations#

Indexing: Create indexes on frequently queried fields such as job_id, company, and job_profile to improve query performance.
Field Selection: Use projection to limit returned fields when querying large collections.
Pagination: Implement pagination for listing jobs to avoid loading excessive data.
Batch Operations: Use bulk write operations when inserting or updating multiple job documents.

Troubleshooting Guide#

Common issues and resolutions:

Missing job_id: Ensure job_id is present before upserting to avoid errors.
Duplicate job entries: Use job_id as the unique identifier to prevent duplicates.
Invalid arrays: Ensure branches and batch_years arrays are not empty and contain valid data.
Incorrect timestamps: Verify that application_deadline is greater than posted_at if both are present.
Database connectivity: Confirm MongoDB connection and collection initialization.

Section sources

Conclusion#

The Jobs collection schema provides a structured representation of job profiles extracted from SuperSet, enabling efficient storage, querying, and notification workflows. By adhering to the defined schema and validation rules, the system ensures data consistency and supports robust job posting and filtering capabilities.

Previous Collection Schemas & Field Definitions

Next Notices Collection

JIIT Placement Alerts

Architecture & Design

Core Services

Data Processing & Content Extraction

Server Components

Jobs Collection

Table of Contents#

Introduction#

Project Structure#

Core Components#

Architecture Overview#

Detailed Component Analysis#

Job Model Definition#

Jobs Collection Schema#

Example Documents#

Data Persistence Flow#

Dependency Analysis#

Performance Considerations#

Troubleshooting Guide#

Conclusion#